LSa247851_0026 .Dna -^nf 
Gcg_Geneseq_D:T19113 yY ^-(^^^^^ 

ID T19113 standard; cDNA to mRNA; 351 BP. 

AC T19113; 

DT 04-JUL-1996 (first entry) 

DE Human gene signature HUMGS00127. 

KW Gene signature; messenger RNA; mRNA; relative abundance; frequency; 

KW human; cloning; mapping; non-biased library; diagnosis; detection; 

KW cell typing; abnormal cell function; ss . 

OS Homo sapiens . 

PN WO 9514772-Al. 

PD Ol-JUN-1995. 

PF ll-NOV-1994; 94WO- JP01916 . 

PR 12-NOV-1993; 93 JP- 0355504 . 

PA (MATS/) MATSUBARA K. 

PA (OKUB/) OKUBO K. 

PI Matsubara K, Okubo K; 

DR WPI; 1995-206931/27. 

PT Identifying gene signatures in 3 * -directed human cDNA library - e.g. 

PT for diagnosis of abnormal cell function, by preparing cDNA that 

PT reflects relative abundance of corresp. mRNA in specific human 

PT tissues 

PS Claim 1; Page 304; 2245pp; Japanese, 

CC A single- stranded DNA (or its complementary strand or the corresp. 

CC double -stranded DNA) which comprises one of the 7837 "GS" sequences 

CC given in T19001-T26837 and which is able to hybridise to part of 

CC human genomic DNA, cDNA or mRNA is claimed. The GS (Gene Signature) 

CC sequences were obtained from 3 ' -directed cDNA libraries prepared 

CC from various human tissues; synthesis of cDNA was initiated from the 

CC 3 '-end of mRNA by using poly(T) as the sole primer. Since the 3'- 

CC untranslated sequence is unique to a particular mRNA species, almost 

CC all the 3 ' -oriented cDNAs hybridise with specific mRNAs . Each library 

CC is constructed so as to reflect accurately the relative abundance of 

CC different mRNAs in the particular tissue from which it was derived. 

CC The appearance frequency of a given GS in a cDNA library can be 

CC determined (esp. using primers and probes derived from the GS 

CC sequences) as a means of diagnosing abnormal cell function or for 

CC recognising different cell types. 

SQ Sequence 351 BP; 123 A; 45 C; 58 G; 121 T; 4 other; 

SCORES Initl: 1638 Initn: 1638 Opt: 1706 z-score: 597.6 E(): 5,9e-24 

98.9% identity in 349 bp overlap 

1480 1490 1500 1510 1520 1530 

Sa2 4 7 8 5 1_0 0 2 TTATCTGCCATTTAT^GTATCCAGTATTTGATCACATTATTATA^ 

Ti 9 1 1 3 GATCACATTATNATAAATAAATGAAAAAAT 

10 20 30 

1540 1550 1560 1570 1580 1590 

Sa247851 002 GATTTAATCTGTAATAAACTGGTTTATTGTGCAGTGACTGTAATATACTAGA^^ 

lllllllllllllllllllliiilllllMIIIIMIIIIIIIIIIIIIIIIIIIIIIM 

T19113 GATTTAATCTGTAATAAACTGGTTTATTGTGCAGTGACTGTAATATACTAGAGTTATAAT 



40 50 60 70 80 90 

1600 1610 1620 1630 1640 1650 

Sa247851_002 AAATTOTTTACTCTGCCTCACCAAACACATGCTAGGATATAACCCCCAAAATAAGT^ 

IMIIIMIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIII 

Tl 9 1 1 3 AAATTGTTTACTCTGCCTCACCAAACACATGCTAGGATATAACCCCCAAAATAAGTATTT 

100 110 120 130 140 150 

1660 1670 1680 1690 1700 1710 

S a2 4 7 8 5 1_0 0 2 AACTTTGCATTAGGTATAAAGGAGACTGGGTGCTATAATTAGATTATTTTGAGGCAGACA 

IIIIIIMMMIIIIIIIMIIMIIilMIIIIIIII IIIMIIMMIIIIIIIII 

Tl 9 1 1 3 AACTTTGCATTAGGTATAAAGGAGACTGGGTGCTATAATNAGATTATTTTGAGGCAGACA 

160 170 180 190 200 210 

1720 1730 1740 1750 1760 1770 

Sa2 4 7 8 5 1_0 0 2 GAGAGCTGTTATCCTAACTGATTTAGTATGTTCTGTAATTGAGAAAATGTTCACCAAATT 

IMIMMIIIIIIIIMIIIIIIIIIIIIilllllllllMIIIMIIIIIIIIIIII 

Tl 9 1 1 3 GAGAGCTGTTATCCTAACTGATTTAGTATGTTCTGTAATTGAGAAAATGTTCACCAAATN 

220 230 240 250 260 270 

1780 1790 1800 1810 1820 1830 

lllllllllilllillMIIIIIIMMIIIIIIIIIIIIIIMIIIIIIIIIMIIIII 

T 1 9 1 1 3 ATACTTTTTAGTGATTTACATGTACATTTTATAGGGGACATGTTCTGTGTATAGCGAATA 

280 290 300 310 320 330 

1840 1850 1860 1870 

Sa247851_002 AATAACTTTTATAGTATCAAAAAAAAAAAAAAAA 

lllllllllllllllllll 
T19113 AATAACTTTTATAGTATCACN 

340 350 
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0 GENE SIGNATURE. 



® A 3'-directed cDNA library which accurately reflects the abundance ration of mRNA in a cell has been 
prepared from various human tissues, and sequencing of the cDNAs contained in the library has be conducted 
^ to examine the incidence of each cDNA in each tissue. As each cDNA has expression information with each 
^ tissue corresponding to the mRNA concentration, these cDNAs are usable as a probe or primer for detecting cell 
^ anomoly or discriminating cells. The cloned gene can produce porteins utilizable as a medicine or the like. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME : CHUGAI PHARMACEtJTICAL CO . , LTD . 

(B) STREET: 41-8, Takada 3 -chrome, Toshima-ku 

(C) CITY: Tokyo 

(E) COUNTRY: JAPAN 

(F) ZIP: 171 

(ii) TITLE OP INVENTION: GENE SIGNATURE 

(iii) NUMBER OF SEQUENCES: 7848 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.5 in., DS, 1.44 MB 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/ MS-DOS 

(D) SOFTWARE: MS-DOS 

(v) CURRENT APPLICATION DATA 

(A) APPLICATION NUMBER: EP 95900295.7 

(vi) ' PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: PCT/JP94/01916 

(B) FILING DATE: 11. November 1994 
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SEQ ID NO: 7844 
SEQUENCE LENGTH: 37 
SEQUENCE TYPE:iiucleic acid 
s SmNDEDNESS: single 

TOPOLOGY: linear 
SEQUENCE DESCRIPTION: 

CTCGCTCGCC CATCCTTATA CAGGCTCAGT TTTGTCT 37 

SEQ ID NO: 7845 
SEQUENCE LENGTH: 37 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
'5 TOPOLOGY: linear 

SEQUENCE DESCRIPTION: 

CTCGCTCGCC CATGTATAGG GACAGCATTT CTGAGAG 37 



SEQ ID NO: 7846 

20 

SEQUENCE LENGTH: 38 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
25 SEQUENCE DESCRIPTION: 

CTGGTTCGGC CCACCTCTGA AGGTTCCAGA ATCGATAG 38^ 



SEQ ID NO: 7847 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE: nucleic acid 
STRANDEONESS: single 
TOPOLOGY: linear 
SEQUENCE DESCRIPTION: 
CCAGGGTTTT CCCAGTCACG AC 



SEQ ID NO: 7848 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE:nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
SEQUENCE DESCRIPTION: 

TCACACAGGA AACAGCTATG AC 22 



50 Claims 

1. A purified single-stranded DNA. a purified single-stranded DNA complementary thereto, or a purified 
double-stranded DNA consisting of said single strands, containing all or a portion of a single-stranded 
DNA or a single-stranded DNA complementary thereto comprising any of the base sequences listed 
55 under SEQ ID NO 1-7837 and hybridizing specifically to a particular site of human genomic DNA. 

human cDNA or human mRNA. 
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2. A DNA probe consisting of a purified single-stranded DNA . a purified single-stranded DNA com- 
plementary thereto, or a purified double-stranded DNA consisting of said single strands, containing all 
or a portion of a single-stranded DNA or a single-stranded DNA complementary thereto comprising any 
of the base sequences listed under SEQ ID NO 1-7837 and hybridizing specifically to a particular site 

5 of human genomic DNA, human cDNA or human mRNA. 

3. A DNA primer consisting of a purified single-stranded DNA, a purified single-stranded DNA com- 
plementary thereto, or a purified double-stranded DNA consisting of said single strands, containing all 
or a portion of a single-stranded DNA or a single-stranded DNA complementary thereto comprising any 

10 of the base sequences listed under SEQ ID NO 1-7837 and hybridizing specifically to a particular site 

of human genomic DNA. human cDNA or human mRNA. 

4. A purified single-stranded DNA, a purified single-stranded DNA complementary thereto, or a purified 
double-stranded DNA consisting of said single strands, containing all or a portion of a single-stranded 

75 DNA or a single-stranded DNA complementary thereto, wherein said single-stranded DNA is com- 

plementary to a human mRNA containing any of the base sequences listed under SEQ ID NO 1-7837 
(wherein T is read as U) or any portion thereof at its 3* region, and hybridizing specifically to a 
particular site of human genomic DNA. human cDNA or human mRNA. 

20 5. A DNA probe consisting of a purified single-stranded DNA. a purified single-stranded DNA complemen- 
tary thereto, or a purified double-stranded DNA consisting of said single strands, containing all or a 
portion of a single-stranded DNA or a single-stranded DNA complementary thereto, wherein said single- 
stranded DNA is complementary to a human mRNA containing any of the base sequences listed under 
SEQ ID NO 1-7837 (wherein T is read as U) or any portion thereof at its 3* region, and hybridizing 

25 specifically to a particular site of human genomic DNA. human cDNA or human mRNA. 

6. A DNA primer consisting of a purified single-stranded DNA. a purified single-stranded DNA com- 
plementary thereto, or a purified double-stranded DNA consisting of said single strands, containing all 
or a portion of a single-stranded DNA or a single-stranded DNA complementary thereto, wherein said 
30 single-stranded DNA is complementary to a human mRNA containing any of the base sequences listed 

under SEQ ID NO 1-7837 (wherein T is read as U) or any portion thereof at its 3' region, and 
hybridizing specifically to a particular site of human genomic DNA. human cDNA or human mRNA. 
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Figr. 1 
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Fig. 3 
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Figr- 11 
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Hybrid cells used for Southern hybridization 
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fig. 18 
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Chromosomal mapping of each GS by Southern blot technique 
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